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FOREWORD 


This publication covers statistical concepts and procedures used in bulk oil 
measurement. 

Suggested revisions are invited and should be submitted to the director of the 
Measurement Coordination Department, American Petroleum Institute, 1220 L 
Street, N.W., Washington, D.C. 20005. 
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Chapter 13—Statistical Aspects of Measuring and Sampling 


SECTION 1—STATISTICAL CONCEPTS AND PROCEDURES IN MEASUREMENT 


13.1.0 


The nature of physical measurements makes it impos- 
sible to measure a physical variable without error. Abso- 
lute accuracy is only achievable when it is possible to 
count the objects or events; even then, when large num- 
bers are involved, it may be necessary to approximate. 
With the best equipment and directions, the potential 
for errors in fluid volume measurements involving large 
amounts of material is large. 

Minimizing errors, estimating the remaining errors, 
and keeping all parties informed of errors is increasingly 
important to the petroleum industry. Equally important 
is an understanding of the size and significance of errors. 
Providing estirnates of errors and statements concerning 
errors in a standard form can help avoid disputes and 
dispel delusions of accuracy in statements of quantity. 

Chapter 13 of the Manual of Petroleum Measurement 
Standards is designed to help those who make measure- 
ments of bulk oil quantities improve the value of their 
result statement by making proper estimates of the un- 
certainty or probable error involved in measurements. 
During the development of Chapter 13.1, reference was 
made to Part XIV, Section | (Tentative) of the Petroleum 
Measurement Manual published by the Institute of Pe- 
troleum, London, England. 


Introduction 


13.1.1 Scope 


This chapter covers the basic concepts involved in es- 
timating errors by statistical techniques and ensuring 
that results are quoted in the most meaningful way. The 
statistical procedures that should be followed in estimat- 
ing a true quantity from one or more measurements and 
in deriving the range of uncertainty of the results are dis- 
cussed. Sources of error are examined and examples are 
provided showing how a statement of the overall uncer- 
tainty in completed measurements is derived. 

The subsequent sections (in preparation at the time 
this section was published) of Chapter 13 will deal with 
the application of the concepts discussed in Section 1 to 
various methods for bulk oil measurement widely used 
in the petroleum industry. Chapter 13.1 is a reference 
document explaining theory and the application of sta- 
tistical procedures whereas subsequent sections will pro- 
vide statistical equations and typical examples for 
various types of measurement. 
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13.1.2 Definitions 


The following terms are used throughout Chapter 13. 

Accuracy is the ability to indicate values closely ap- 
proximating the true value of the measured variable. 

Bias is any influence on a result that produces an in- 
correct approximation of the true value of the variable 
being measured. Bias is the result of a predictable sys- 
tematic error. 

Confidence interval or range of uncertainty, C, is the 
range or interval within which the true value is expected 
to lie with a stated degree of confidence. 

Confidence level is the degree of confidence that may 
be placed on an estimated range of uncertainty. 

Degrees of freedom is the number of independent re- 
sults used in estimating the standard deviation. 

Direct measurement is a measurement that produces a 
final result directly from the scale on an instrument. 

Error is the difference between true and observed 
values. 

Indirect measurement is a measurement that produces 
a final result by calculation using results from one or 
more direct measurements. 

Mean, X, is the average of two or more observed 
values. 

Measurement is a procedure for determining a value 
for a physical variable. 

Normal (Gaussian) distribution (see Appendix A). 

The observed value is the result obtained from a 
measurement. 

An outlier is a result that differs considerably from the 
main body of results in a set.. 

Parameters are the values that characterize and sum- 
marize the essential features of measurements. 

Precision is the degree to which data within a set clus- 
ter together. 

A random error is an error that varies in an unpredict- 
able manner when a large number of measurements of 
the same variable are made under effectively identical 
conditions. 

Range, w, is the region between the limits within 
which a quantity is measured. 

Repeatability, r, is a measure of the agreement be- 
tween the results of successive measurements of the 
same variable carried out by the same method, with the 
same instrument, at the same location, and within a 
short period of time. 
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Reproducibility is a measure of the agreement between 
the results of measurements of the same variable where 
individual measurements are carried out by the same 
methods, with the same type of instruments, but by dif- 
ferent observers, at different locations, and after a long 
period of time. 

A result is the observed value of a variable determined 
by a single measurement. 

A spurious error is a gross error in procedure (for ex- 
ample, human errors or machine malfunctions). 

Standard deviation, s, is the root mean square devia- 
tion of the observed value from the average. 

Standard normal deviate (see Appendix A). 

Student’s t is a statistical function that varies in mag- 
nitude with degrees of freedom. 

A systematic error, e, is one that, in the course of a 
number of measurements made under the same condi- 
tions, on material having the same true value of a vari- 
able, either remains constant in absolute value and sign 
or varies in a predictable manner. Systematic errors re- 
sult in a bias. 

True value, X, is the correct value of a variable. 

Variance, V or v, is the measure of the dispersion or 
scatter of the values of the random variable about the 
mean p. 


13.1.3 Nomenclature 


The following algebraic symbols are used throughout 
Chapter 13. 


A True limit of range of uncertainty for 
random errors. 

Estimate of A. 

True limit of range of uncertainty for 
systematic errors. 

Estimate of B. 

True total limit of range of uncertainty. 
Estimate of C. 

Conversion factor (used to derive s from w). 
Estimate of systematic error. 

Number of repeated measurements. 
Number of quantities incorporated in a final 
indirect quantity measurement. 
Number of independent sources of 
systematic error. 

Constants. 

Estimate of repeatability. 

True value of standard deviation. 
Estimate of standard deviation. 

Value of Student’s ¢ distribution. 

True variance, S?. 

Estimate of variance, s?. 

Range of a set of data. 
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True value of a variable. 

Observed mean value of a set of data. 
Observed value of a variable. 

Observed mean value corrected for bias. 
Observed value of a variable corrected for 
bias. 

Mean of Gaussian normal distribution. 
Standard deviation of a Gaussian normal 
distribution. 

@ Degrees of freedom. 
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13.1.4 Statistical Control 


Proper use of statistical techniques requires that the 
measurement process be in a state of statistical control. 
Unless this is achieved, any statement concerning the es- 
timate of the true value of the quantity being measured, 
and the statistical uncertainty associated with it, is not 
strictly valid and may even be meaningless. A measure- 
ment process that is under statistical control will, if 
measurements on the same quantity are repeated by the 
same method and under essentially the same conditions, 
show stability of the mean value and regular scatter of 
individual results (see also 13.1.7). 

Repeatability and reproducibility, when properly es- 
tablished, can be used to monitor statistical control on a 
routine basis (see 13.1.7.1 and 13,1.7.2). 

Strict statistical control is usually very difficult to en- 
sure. An important step in establishing any measure- 
ment procedure is to decide which variables should be 
used to monitor statistical control and to establish target 
values required to maintain an appropriate degree of 
consistency. Some essential elements in statistical con- 
trol are listed here. 


1. The entire measurement procedure and instructions 
must be clearly defined and closely followed. 

2. Independent procedures for checking and maintain- 
ing equipment must be available. 

3. Means for detecting and eliminating equipment 
malfunctions and human mistakes (leading to spurious 
errors) should be incorporated (see 13.1.6.1). 


These features of the measurement procedure must be 
adhered to at all times. Furthermore, control charts and 
other records of equipment performance, maintenance, 
and calibration checks must be used as an integral part 
of statistical control procedures. 


13.1.5 Measurements 
13.1.5.1 TRUE VALUE 


One primary assumption is made; that is, an exact or 
true value exists for any variable, valid for the condi- 
tions that exist at the moment when the result is deter- 
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mined. Generally, the true value X cannot be 
determined, but a valid estimate x can be obtained by 
rigorous application of the appropriate method of meas- 
urement using the specified instruments. By statistical 
analysis of the various errors involved, it is possible to 
use observed values to obtain an estimate of the true 
value and to quantify the reliability of that estimate. In 
any set of measurements, the best estimate of X will be 
the mean X after rejecting outliers and correcting for sys- 
tematic errors. 


13.1.5.2 UNCERTAINTY OF MEASUREMENT 


The usefulness of a result is greatly increased when it 
is accompanied by a statement of its reliability. The sta- 
tistical calculations provided in this chapter give a range 
or interval within which the true value of the variable 
can be expected to lie with a stated degree of confidence. 
The statistical term for such an interval is the confidence 
interval (also referred to as the range of uncertainty of the 
measurement). The limits of a confidence interval about 
an estimate X are expressed as x +C(x); the magnitude 
of --C() depends on the random variability of the 
measurements, unknown systematic errors, and the con- 
fidence level. As an example, consider the following 
statement: 10° --1°C. In this statement, the estimate x 
is 10° and the confidence interval is 1°. 


13.1.5.3 CONFIDENCE LEVEL 


Setting absolute limits to a range of uncertainty is 
rarely possible. It is more practical to give an indication 
of the degree of confidence that may be placed on an 
estimated range of uncertainty. This degree of confi- 
dence, or confidence level, indicates the probability that 
the range quoted will include the true value of the quan- 
tity being measured. The most common statistical prac- 
tice is to use the 95 percent confidence level. This level 
implies that there is a 95 percent probability (19 chances 
in 20) that the true value will lie within the stated range. 
The 95 percent level is recommended for all commercial 
applications in petroleum measurement and will be used 
throughout this chapter. In certain limited circum- 
stances, a different degree of confidence may be 
required. 

NotE: Strictly, a confidence level or confidence interval can only be 
used to account for Gaussian random errors or errors that may be so 
treated. Systematic errors must be accounted for before the confidence 


level and interval are applied, and substantial contributions to the to- 
tal uncertainty should be separately recorded. 


13.1.5.4 REPORTING RESULTS 


All results should be reported so that the estimate of 
the true value, and the limits within which the true value 
is expected to lie with a given level of confidence, can be 
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seen at a glance. Results should be written as follows: 
y EC) 95, n (95 percent confidence level, n measure- 
ments) from which the following relevant information 
can be obtained: 


]. }is a mean value of n measurements, is corrected for 
all known systematic errors, and is the estimate of the 
true value. 

2. There is 95 percent probability that the true value 
lies between y — C(y) and y + C(j). 

3. There is 95 percent probability that any further sin- 
gle measurement will ihe within py = = C(y)/n. 


Expanding on the example, assume that the following 
temperature measurements were taken for a delivery 
batch of crude oil: 10, 8, 11, 9, and 12°C; and the confi- 
dence interval was determined to be + 2°C, Then y = 
10, n = 5 and the result statement would be: 10° + 2°C 
(95 percent confidence level, 5 measurements). 


13.1.6 Types of Errors 


The difference between the observed value of a vari- 
able and its true value includes all errors associated with 
the person actually taking and recording the results, in- 
strument errors, procedure errors, and errors resulting 
from sampling procedures or changes in conditions dur- 
ing the period of measurement. There are three basic 
types of errors that must be considered: spurious errors, 
systematic errors, and random errors. 


13.1.6.1 SPURIOUS ERRORS 


Spurious errors are gross errors, such as misapplica- 
tion of method, incorrect reading or recording, and in- 
strument malfunction. These errors cannot be incor- 
porated into any statistical analysis and the results must 
be discarded. 

There are statistical methods of testing for outliers 
(see Appendix B), but these methods should only be ap- 
plied if there is good reason to believe that spurious er- 
rors exist. Data should not be discarded lightly, and the 
«observer should record what information has been dis- 
-carded and state the reasons. 


13.1.6.2 SYSTEMATIC ERRORS 


A systematic error is one that, in the course of a 
number of measurements made under the same condi- 
tions on the same variable, remains constant or varies 
predictably. Thus, systematic errors cause bias in the re- 
sults. The bias can be positive or negative, leading to 
over- or underestimation of the true value of the variable 
being measured. In many liquid measurement applica- 
tions, systematic errors may make a larger contribution 
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to the overall uncertainty of a result than random errors 
(see 13.1.6.3), Systematic errors must be identified and 
either eliminated or compensated for before interpreting 
overall results statistically (see note in 13.1.5.3). 

Ideally, bias could be considered as constant for all 
measurements made by the same operator and equip- 
ment. Unfortunately, assessment of bias is complicated 
by the fact that some contributions to bias do vary with 
time. For example, knowledge and control of test condi- 
tions may be inadequate or an instrument may wear pro- 
gressively. Such factors will probably not change 
significantly during the course of one set of measure- 
ments but could change both in magnitude and sign over 
a longer period. 

Assessment of systematic errors by experimental 
means is difficult, especially when variation with time is 
involved. Errors introduced by the observer or by 
changes in operating conditions are probably easiest to 
identify, but any experimental evaluation of systematic 
error may involve a complete change of equipment, 
which is often not feasible. The alternative to experimen- 
tation is to make a subjective assessment on the basis of 
experience and knowledge of the instruments involved. 

In any event, if the conditions of measurement are un- 
changed, increasing the number of measurements will 
not reduce the effects of systematic error. 

All conceivable sources of error must be identified, ex- 
amined methodically, and assessed quantitatively to es- 
tablish whether they make a significant contribution to 
bias. 


13.1.6.3 RANDOM ERRORS 


Random errors are caused by small independent infiu- 
ences that prevent a repeated measurement from giving 
an identical result, although the true value of the vari- 
able involved remains the same, Results that contain 
only random errors are amenable to statistical analysis. 
Random errors are assumed to follow a normal (Gaus- 
sian) distribution, described in Appendix A. Provided 
that all systematic errors can be accounted for, the cor- 
rected mean value y and the range of uncertainty --C(y) 
can be calculated. Increasing the number of measure- 
ments reduces the value of C() and hence improves the 
reliability of the final estimate y. 


13.1.7 Accuracy and Precision 


A set of measurements subject to the smallest system- 
atic errors will be expected to have their mean closest to 
the true value, and this is said to give the most accurate 
set. It is also evident that the set of measurements sub- 
ject to the smallest random errors will be expected to be 
clustered closest together, and so form the most precise 
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set. Within these rather narrow statistical definitions, 
precise measurements are not necessarily accurate, since 
they could cluster about a point that is not the true 
value. Conversely, it is possible to have a set of measure- 
ments that are accurate taken as a group although 
widely scattered and of doubtful reliability when taken 
singly. 

This distinction is important since any statement con- 
cerning reliability must account for both systematic er- 
rors and random errors, as statistically defined. In 
practice, accuracy in measurement cannot exist without 
precision, so every effort must be made to satisfy both 
criteria of reliability. 

The precision of a method of measurement can be de- 
termined quantitatively and is conventionally expressed 
as repeatability and reproducibility, 


13.1.7.1. REPEATABILITY 


The repeatability of a method of bulk measurement is 
a quantitative measure of the random error associated 
with a single operator at a given location, obtaining suc- 
cessive measurements on the same body of material over 
a short time interval, with the same measuring devices, 
and under constant operating conditions. Repeatability 
is defined as the difference between two such measure- 
ments that would be exceeded in the long run in 1 case 
in 20 in the normal and correct operation of the method 
of measurement. Repeatability is the range of uncer- 
tainty (95 percent confidence level) for the difference be- 
tween two measurements obtained under the same 
conditions. 

The short time interval between measurements is es- 
sential to ensure that external conditions are kept as 
nearly constant as practicable. The time interval should 
be of the same order of magnitude as the duration of a 
single measurement, For example, if a measurement 
takes 5 minutes to carry out, the interval before a second 
measurement should not exceed 10 minutes. 


13.1.7.2. REPRODUCIBILITY 


The reproducibility of a bulk measurement method is 
a quantitative expression of the random error associated 
with different operators working in different locations 
with different instruments, with each operator obtaining 
single measurements on the same, body of material by 
using the same method and the same types of measuring 
devices. It is defined as the difference between two such 
single and independent measurements that would be ex- 
ceeded in the long run in only | case in 20 in the normal 
and correct operation of the method of measurement. 
Reproducibility is the range of uncertainty (95 percent 
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confidence level) for the difference between two meas- 
urements obtained under the same conditions. 

Good reproducibility indicates that random errors are 
acceptably small (good repeatability) and that system- 
atic errors other than those inherent in the method are 
probably also very limited in size and number. Repro- 
ducibility conditions as defined, can rarely be met in 
quantitative bulk oil measurement because the identity 
of a body of oil is almost invariably lost during its move- 
ment from one place to another (the only possible excep- 
tion being measurement by gage or weighing of a vehicle 
or ship). However, a close approximation to reproduc- 
ibility conditions could be achieved for measurements 
such as gaging a storage tank if two operators each set 
up their own apparatus for the prescribed method of 
measurement at the same location. 


13.1.7.3 APPLICATION OF PRECISION TO A 
SINGLE MEASUREMENT 


As stated previously, reproducibility of a measure- 
ment method, strictly defined, is not a concept that can 
often be utilized. The application of repeatability is also 
limited in normal commercial transfer measurement, 
since the second measurement required to establish a 
difference between two results is not a practical proposi- 
tion in everyday work using meters. 

In practice it is necessary to conduct a special exercise 
and obtain repeated determinations of the result with 
the apparatus that is to be used at a given site and to use 
these determinations to estimate the random error of a 
single measurement. This random error is expressed as 
the range of uncertainty about a single measurement 
rather than the range for the difference between two 
measurements as would be the case for repeatability and 
reproducibility. This estimate of the range of uncertainty 
is then used for all routine measurements until such time 
as a complete re-check of the apparatus and method is 
undertaken (usually at prescribed regular intervals). 

The estimated range of uncertainty so obtained would 
cover errors both from the instruments used and from 
the calibration system employed. It should also be noted 
that, without repeated measurements, it is impossible to 
use the range of uncertainty as a means of monitoring 
statistical control on a short-term basis, as was the case 
with repeatability (see 13.1.7.1). 


13.1.8 Statistical Procedures 


True value and range of uncertainty are two impor- 
tant characteristics describing the measurement of any 
physical variable. There are characteristics that describe 
other features of the results, such as random error, stan- 
dard deviation, and bias, and these are called the param- 
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eters of the population of observed variables. Parameters 
are all assumed to have true values. The procedures de- 
scribed in this section are used to derive estimates of the 
parameters, known as statistics, from the set of measure- 
ments obtained. Parameters will be represented algebrai- 
cally either by Greek letters (for example, «4 and o) or 
true values by capital Roman letters, and observed val- 
ues will be represented by lowercase Roman letters. 

In general, the result in question will be a function of 
one or more intermediate results, each of which could 
contribute to both the final result estimate and its range 
of uncertainty. The statistics for each intermediate result 
should be established first. Intermediate results will be 
combined to give the statistics that relate to the final 
result. 


13.1.8. STATISTICAL PROCEDURE FORA 
SINGLE SET OF DATA 


Statistics are derived from a single set of n repeated 
measurements x;, for i = 1 to n. Each measurement will 
be an estimate of X, the truce value of the variable but 
will be subject to both systematic and random errors 
(see 13.1.6). All known sources of systematic error 
should be accounted for before the true value and range 
of uncertainty are estimated. In the interest of clarity, it 
is good practice to record the source and magnitude of 
each error separately. 

Random errors are assumed to follow the normal 
(Gaussian) distribution (see 13.1.6.3 and Appendix A), 
which is fully determined if its parameters y (mean) and 
o (standard deviation) are determined. These two pa- 
rameters are estimated from the measurements obtained. 
The possible sources and magnitudes of the systematic 
errors to be found in measuring systems are given in de- 
tail in 13.1.8.1.1 through 13.1.8.1.7. 


13.1.8.1.1| Number of Repeated Measurements 


Required 


There is no fixed value for the optimum number of 
measurements required to establish a true value and a 
range of uncertainty. On the one hand, , the number of 
repeated measurements, has no bearing on the deterimi- 
nation of systematic errors that are present to the same 
extent in all measurements made under the same operat- 
ing conditions (see 13.1.7.2). On the other hand, the sta- 
tistics relating to random errors (for example, mean and 
standard deviation) are not independent of n, since the 
larger n becomes, the closer estimates will approach 
their true values and the smaller will be the range of un- 
certainty (see 13.1.7.3). 

Very often it is only practical to obtain from five to 
ten measurements in the field. This is perfectly accepta- 
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ble for the day-to-day estimate of a mean value, but 
greater reliability is required for a statistic that is to be 
used as a standard measure. This is the case for 
repeatability (see 13.1.7.1), which should be estimated 
from at least 20 and preferably 30 or more repeated 
measurements. A similar argument applies when esti- 
mating the range of uncertainty for single measurements 
(see 13.1.7.3). 


13.1.8.1.2 Outlying Results 


Results that are subject to spurious errors (see 
13.1.6.1) may differ considerably from the remaining re- 
sults in the set. These are called outlying results. If a re- 
sult is suspected to be an outlier but is not easily 
identifiable, then the set of results should be tested for 
outliers according to the procedures given in Appendix 
B. The suspect result should be discarded if the test 
proves significant. It should be stressed, however, that a 
good reason is required before a result is rejected, and 
that reason should be clearly stated. 

When the repeatability of the method of measurement 
has been established, it is possible to make a preliminary 
check for outliers by the range test illustrated in 
13,1.8.1.7. Note that constant systematic errors cannot 
be detected in an outlier test because they are present to 
the same extent in all results of the quantity in question 
(see 13,1.6.2). 


13.1.8.1.3 Correcting for Bias 


If a constant systematic error e is known to exist, for 
example, a depth gage is known to give a consistent bias 
1 millimeter above the true reading due to faulty calibra- 
tion, then each of the measurements x, should be ad- 
justed accordingly. The adjusted results, y,, will then be 
the most accurate available (see 13.1.7) and are given by 
the expression: 


Vi =k, e (1) 
Note that the systematic error could be level dependent, 


that is, a constant function (for example, percentage) of 
the measurement itself: 


e = f(x) (2) 


An example of this would be a ‘direct reading meter 
known from experience to give a consistent bias 1 per- 
cent above the true value..In that case, Equation 1 
becomes: 


Yi = Xi — Fx) (3) 


There are times, however, when the systematic error is 
unknown in magnitude and/or sign, usually due to vari- 
ations in operating conditions over a long interval (see 
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13.1.6.2). In that case, the average systematic error @ 
should be estimated, taking into account the conditions 
that affect the measurements at the time. Very often the 
only way to estimate the average is to calculate the mean 
range in which the errors could lie. If the errors were 
estimated to range from e, to @., the average systematic 
error would be: 


@ = (e, + e,)/2 (4) 


Individual measurements should then be adjusted by the 
mean value e as in Equation 5: 


Wry —e (5) 
Note that when the systematic error takes positive or 
negative values up to the same maximum (e, = —e,) no 


correction will be made. Note also that unknown sys- 
tematic errors will contribute to the range of uncertainty 
for the true value estimate (see 13.1.8.1.6.3). 


13.1.8.1.4 Estimating True Value 


The results y, (x, corrected for bias) are now subject to 
random errors and unknown systematic errors. As previ- 
ously stated, the measurements y, are assumed to follow 
the normal distribution with mean ps and standard devia- 
tion a. The estimate of the mean that is most likely to be 
correct, or the “maximum likelihood” estimate of p, is 
the average y of the set of corrected measurements: 


i Ms 


1 1 
Y= Ont Ye to + In) = re ve (©) 


If only one result is available, the result is the estimate of 
the true value. 


13.1.8.1.5 Estimating Standard Deviation 


The standard deviation o (y) describes the random er- 
ror of a single measurement. 

The maximum likelihood estimate s(y) of the standard 
deviation is calculated from the set of corrected results 


(;) as follows: 
A - 
Pr ea 
i=l] 


or 


ll 


sv) 


— 


s(') THI = ye (7) 


i=1 


A less complicated but more approximate estimate is: 
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w 
s(y) av D(n) (8) 
Where: 
w = the range of the set of measurements (for 1 
< 12). 
D(n) = a conversion factor (see Table 1). 


A further approximation can be made by replacing D(n) 
by (n)**. It should be stressed, however, that Equation 8 
is approximate since it should theoretically apply to the 
average range w of a number of sets of n measurements. 
A more reliable estimate would be obtained from the av- 
erage range of six pairs of results than from the range of 
a single set of twelve repeated results. For this reason, 
the equation should only be used as a quick check to 
monitor statistical control and not for data interpreta- 
tion (see 13.1.4). 

The standard deviation of the average of n repeated 
results can be calculated as: 


= o(y) 
oF) = Te ©) 
In terms of estimates, the standard deviation, or as it 
is more commonly called, the standard error, of the aver- 
age becomes: 


sp) = (10) 
As the number of measurements increases, the stan- 
dard error of the average will decrease. Therefore, an av- 
erage based on a large number of measurements would 
in this sense be more reliable than one based on a small 
number of measurements (see 13.1.8.1.1). Furthermore, 
since the distribution of any average tends toward the 
normal as n becomes larger, Equation 10 would still hold 
true if the distribution of individual results deviated 
from the normal distribution. 


13.1.8.1.6 Estimating Range of Uncertainty 


For a measurement function, here called G, the limit 
C(G) of the range of uncertainty (see 13.1.5.2) consists of 
two parts, the limit A(G) due to random errors and the 
limit B(G) due to unknown systematic errors (see 
13.1.8.1.3). The estimation of A, B, and C depends to a 
large extent on the nature of G, be it a single measure- 
ment or an average, and on the nature of the errors pre- 
sent. (In this section, the expression “limit of the range 
of uncertainty” will often be referred to in shortened 
form as “limit of uncertainty” or “uncertainty limit.) 


Table 1—Range Conversion Factor 


Number of Conversion Number of Conversion. 

Measurements, Factor, Measurements, Factor, 
n D(n) n D(n) 
2 1,128 8 2.847 
3 1.693 9 2.970 
4 2.059 10 3.078 
5 2.326 li 3.173 
6 2.534 12 3.258 
7 2.704 


Source: Davies, O. L., Statistical Methods in Research and Production, 
2nd Edition, Longman, 1984. 


13.1.8.1.6.1| Uncertainty Due to Random Errors 


The limit A(y) of the range of uncertainty due to ran- 
dom errors about a single measurement y is simply the 
product of the standard deviation o(y) and the standard 
normal deviate (see Appendix A). For 95 percent 
probability, the standard normal deviate has a value of 
1.96, that is, 


A(y) = 1.96 oy) dh 


In general, the standard deviation will be estimated 
from Equation 7 as s(y). To take this into account, the 
limit of random uncertainty calculated from s(y) should 
be based not on the standard normal deviate, but on a 
value known as Student’s ¢, which varies in magnitude 
with the degree of freedom. For the purpose of this doc- 
ument, degrees of freedom may be regarded as the 
number of independent measurements used in estimat- 
ing the standard deviation, which for » measurements 
will be n — 1 (1 degree of freedom having been used in 
calculating the average). The limit of the range of uncer- 
tainty for single measurements (see 13.1.7.3), will, in this 
case, be estimated as: 


aly) = (tos, a1) SO”) (12) 


Where: 


tys. 21 = the value of the ¢-distribution for (7-1) 
degrees of freedom and for a two-sided 
probability of 95 percent (two-sided since 
the range of uncertainty covers both sides 
of the true quantity estimate). 


Values of the ¢-function are given in Table 2. Once 
again, by using Equation 10, the limit for an average will 
become: 


a(y) = (fg, nt) X s(y) 
= (bys. a1) X sy) (13) 


ns 
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Table 2—tDistribution Values for 95 Percent 


Probability (Double-Sided) 


Degrees Degrees 
of Freedom of Freedom 
$ bes, @ > tus, @ 
a 
1 12.706 18 2.101 
2 4,303 19 2.093 
3 3.182 20 2.086 
4 2.716 21 2.080 
5 2.571 22 2.074 
6 2.447 23 2.069 
7 2.365 24 2.064 
8 2.306 25 2.060 
9 2.262 26 2.056 
10 2.228 27 2.052 
il 2.201 28 2.048 
12 2.179 29 2.045 
13 2.160 30 2.042 
14 2.145 40 2.021 
15 2.131 60 2.000 
16 2.120 120 1.980 
17 2.110 - 1,960 


SOURCE: Fisher and Yates, Statistical Tables for Biological, Agricul- 
tural, and Medical Research. 


Thus, combining Equations 12 and 13: 


ay) = no 


It is worth noting that as n becomes very large, so the 
t-value approaches the standard normal deviate, and the 
standard deviation estimate s(y) approaches the true 
value o (y). 


13.1.8.1.6.2 Uncertainty Due to Systematic Errors 


Systematic errors can affect results by creating a bias, 
an uncertainty, or both (see 13.1.6.2). When the errors 
are known, be they level dependent or not, the bias can 
be removed according to 13.1.8.1.3 and no uncertainty 
will exist. On the other hand, when the errors are un- 
known in sign and/or magnitude and even though bias 
can be allowed for according to Equation 4, there will 
still be an uncertainty as to the true value of the variable. 
Nor is it theoretically possible, due to the very nature of 
systematic errors, to express the true limit B in terms of 
the measurements obtained. It is necessary, therefore, to 
estimate the limit for each source of systematic error by 
calculating the absolute value by which the corrected re- 
sults could deviate from their true value with 95 percent 
confidence. Assuming that systematic errors are uni- 
formly distributed and using the symbols defined in 
13.1.8.1.3, this means that: 


(15) 


— é 
2 


by) = 0.95 | 4 


“4. 
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Note that b(y) is the limit of a range of uncertainty, and 
should not be confused with the maximum value (e, or 
é,) that a systematic error could take. If the error takes 
positive or negative values up to the same maximum (e, 
= e.) then: 


b(y) = 0.95|e¢,| = 0.95] e, | (16) 


Note also that since systematic errors are present to the 
same extent for all measurements made under the same 
conditions (see 13.1.6.2), the limit of the range of uncer- 
tainty about an average result y will be identical, that is: 


By) = bly) (17) 


Although it is difficult to handle systematic errors 
with theoretical justification, this should not detract 
from their importance in measuring systems. In many 
cases, systematic errors create greater uncertainty than 
random errors, and for this reason, great care should be 
taken in their identification and estimation. 


13.1.8.1.6.3 | Combining Random and Systematic 
Uncertainties 


In attempting to allow for systematic uncertainties, 
difficulties will arise because systematic errors are often 
variable with time and cannot be identified from a single 
set of measurements obtained under constant operating 
conditions (see 13.1.6.2). This is not to say that system- 
atic errors cannot be estimated at all since good esti- 
mates can be derived from calibration exercises or from 
experience with and knowledge of the measuring system 
involved. A combination of uncertainties is required be- 
cause it is of great value to state the range in which a 
true value is expected to lie. 

There are two schools of thought on how uncertainty 
limits should be combined: (1) by simple addition or (2) 
by a method called root sum square. The latter method 
is theoretically correct if only random uncertainty limits 
are to be combined (see 13.1.8.2.2), but gives a narrower 
range and, therefore, a more optimistic view than the 
former method. In choosing between the two methods, 
consideration must be given not only to theoretical im- 
plications, but also to the manner in which errors are 
found to behave in practice. 

Consider first a set of measurements of the same 
quantity subject to p independent sources of systematic 
error, all of which are unknown but have been estimated 
according to 13,1.8.1.3. Since the errors affect the same 
measurement, they tend to cancel each other out, at least 
to a certain extent. With this in mind, it would be sensi- 
ble to take the more optimistic view and combine the 
systematic uncertainty limits by the root sum square 
method. Assuming that systematic errors follow a uni- 
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form distribution—that there is an equal probability that 
the error lies anywhere throughout its full range—there 
would be a theoretical justification for this choice. As a 
general rule, the total limit of the range of uncertainty 
due to systematic errors should be calculated as: 


by) = VbP(y) + BP) +. + by) (18) 


In this equation, the systematic uncertainties have 
been combined in exactly the same manner as random 
uncertainties (see 13.1.8.2.2). On a theoretical basis, sys- 
tematic and random uncertainties should be combined 
in the same fashion. There is further justification for this 
approach in practical terms, since systematic and ran- 
dom errors would be expected to average each other out 
to a certain extent. This leads to the root sum square 
method for combining systematic and random uncer- 
tainties, which, in terms of average measurements, is: 


cy) = Vay) + bX) (19) 


Note that a(y) becomes smaller as n becomes larger 
(see 13.1.8.1.1), whereas 5(y) will remain unchanged. 
The total limit c(y) will approach the limit b(y) as 7 in- 
creases. This shows the need for care in estimating sys- 
tematic errors, regardless of the number of repeated 
measurements obtained (see 13.1.6.2). Note also that by 
using, Equations 14 and 17, the limit of the range of un- 
certainty for any further single measurement (see 
13.1.5.4) becomes: 


oy) = Yna*{y) + by) (20) 


13.1.8.1.7 Estimating Repeatability 


Since repeatability is defined as the range of uncer- 
tainty due to random errors for the difference between 
two measurements (see 13.1.7.1), it can be estimated di- 
rectly from Equation 12. In this case, the standard devia- 
tion relates to the absolute difference between two 
repeated measurements, y, and y,, and for a normal dis- 
tribution of errors this is: 


o(ly, — el) = V2 0) = V20.) = V2) (21) 


In terms of estimates this becomes: 


sr — Ys) = V2s (y) (22) 


Substituting Equation 22 into Equation 12, the estimate 
r of repeatability will be given by: 


r= (bs. a) [V2 50] (23) 
tys. ne tS described in 13.1.8.1.6. This estimate can be 
compared with a predetermined repeatability value for 
control purposes. If r were excessively great, it would 
imply that measurements were subject to unusually large 
errors. Note that a repeatability estimate that is to be 
used as a standard measure should be based on as many 
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results as possible, at least 20 and preferably 30 or more 
(see 13.1.8.1.1) and would normally be calculated at the 
end of a carefully controlled study. 

Repeatability is most commonly used as a range test 
of the difference between two repeated measurements 
(see 13.1.4 and 13.1.7.1). It can also be used to construct 
a test on the range of three or more measurements. By 
combining Equations 23 and 8, the range can be repre- 
sented by: 


ie woe = (24) 


By substituting a previously determined repeatability 
value into this expression, a critical value can be calcu- 
lated for the range of a set of n measurements. However, 
it is advisable not to use this as a formal outlier test. 
Because the range represents only a part of the informa- 
tion on the variability within a set of measurements (that 
is, the smallest and largest values), the test will only be 
approximate. Nevertheless, it can be used to monitor 
statistical control within a set of measurements (see 
13.1.4 and 13.1.6.2) and flag the need for rigorous 
analysis. . 


13.1.8.2 STATISTICAL PROCEDURE FOR TWO 
OR MORE SETS OF DATA 


In some cases, the quantity in question is obtained in- 
directly from m intermediate and independent results, 
each of which will have been estimated from a separate 
set of data according to the procedures in 13.1.8.1. In 
this section, procedures are given in which the estimates 
for the intermediate quantities are combined to give 
those relating to the final quantity. 


13.1.8.2.1 Estimating True Value 


The value X of the final result is assumed to be a func- 
tion F of the m intermediate quantities X|, X, ... Xm- 
Algebraically, this can be represented by: 


X = F(X, Xy .. Xn) (25) 


The maximum estimate of X is obtained simply by 
substituting into Equation 25 the appropriate estimates 
for X,, X,,... Xm. In terms of measurements corrected 
for bias (see 13.1.8.1.3), the estimate y of the true vari- 


-able will become: 


Y = Fs Yor Ym) (26) 


As an example, consider a relationship between quan- 
tities of the form: 


X = PX,X. + QX, (27) 
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P and Q are known constants. The estimate of the final 
quantity, according to Equation 26, is then: 


y= PY, Yo + Qys (28) 


Note that the calculation resulting from such an equa- 
tion could give rise to a further source of systematic er- 
ror (see 13.1.8.2.3). This would be the case, for example, 
in estimating the volume of a tank from tables of liquid 
depth. The intermediate results would include assumed 
values for tank dimensions, with further assumptions on 
environmental conditions, and these could all contribute 
to unknown systematic errors. 


13.1.8.2.2 Combining Random Uncertainties 


Random error is represented statistically by the stan- 
dard deviation (sometimes called standard error) associ- 
ated with a particular measurement function (see 
13.1.8.1.5). It is useful when combining random errors to 
consider another parameter called variance. Standard 
deviation o is simply the square root of the variance V: 


V(X) = 0X) (29) 


In terms of estimates corrected for bias: 


Wy) = s*(y) (30) 


Any of the expressions dealing with standard devia- 
tion may be converted to the corresponding expressions 
for variance by substituting Equation 29 or 30. 

Now consider the random errors associated with the 
m intermediate quantitics, The variance of the indireci 
measurements of the final quantity is given approxi- 
mately by the expression: 


de Bec (ae) WX) (31) 


of/oX;, represents the partial differential coefficient of F 
with respect to X,, and F is Equation 25. oF/oX, may be 
regarded as the change in F brought about by unit 
change in X;. Equation 31 only holds true, however, if 
the quantities X,, X., ... X», are independent of each 
other. Furthermore, the equation leads to the root sum 
square method of combining random uncertainty limits 
(see 13,1.8.1.6.3), for by substituting into it Equations 11 
and 29, it becomes: 


An = VES ACK + [oe 403] +. + [Se 4m] (32) 
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The corresponding equation in terms of estimates will 
be: 


a3) = ibe ag) + Es i) and] 


(33) 


13.1.8.2.3 Combining Systematic Uncertainties 


As previously stated, there are theoretical difficulties 
when attempting to combine systematic uncertainties 
(see 13.1.8.1.6). The choice is between the arithmetic and 
root sum square methods of combining and should take 
into account the manner in which the errors behave in 
practice. This is sometimes difficult to judge, particularly 
for the multiplicative terms in a relationship (see Equa- 
tion 27). Assuming a uniform distribution of systematic 
errors, however, it is theoretically correct to combine the 
systematic errors in a multiplicative function by the root 
sum square method. This, coupled with the fact that sys- 
tematic errors combined in an additive fashion are ex- 
pected to cancel each other out to a certain extent, leads 
to the choice of the root sum square method as applica- 
ble in the general case. The appropriate formula is iden- 
tical in form to Equation 33 but with random 
uncertainty limits replaced by the corresponding system- 
atic limits: 


by) = 
(34) 


The point to remember when combining systematic 
uncertainties is that the relationship between quantities 
(see Equation 25) may only be approximate (see 
13.1.8.2.1). In that case, a further unknown systematic 
error could be present, and the corresponding uncer- 
tainty limit should be estimated according to Equation 
15. This should be included as another squared term.in 
the uncertainty expression (Equation 34). 


13.1.8.2.4 Estimating Total Uncertainty 


For the reasons already explained in 13.1.8.1.6.3, the 
random and systematic components of the total uncer- 
tainty should be combined by quadrature according to 
Equation 19. In this case, however, a(j) will be esti- 
mated from Equation 33 and b() from Equation 34. 


13.1.8.3 ROUNDING STATISTICALZ:ESTIMATES 


When applying the procedures of 13.1.8.1 and 
13.1.8.2, it is important to consider the effect of round- 
ing on the statistical estimates derived. Rounding that is 
too coarse will become a significant source of error. Any 
particular result will be reported to the smallest unit of 
measure of the instrument involved, and the statistics 


Ee oa) + ley 67) tot ieee bFn)} 
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that relate to that result should reflect this level of accu- 
racy and should be rounded accordingly. For example, a 
gage reading would be reported to the nearest millimeter 
if that was the scale unit of the tape measure. Estimates 
of the mean gage, standard deviation, and the limit of 
the range of uncertainty should also be rounded to the 
nearest millimeter, and the calculations leading up to 
those estimates should include a sufficient number of 
digits to achieve this. 

Particular care should be taken when considering 
more complicated functions, such as would be found in 
the indirect estimation of a parameter from a number of 
intermediate calculations. It is useful to relate the calcu- 
lations to the units in which the final estimate is to be 
reported, In a root sum square estimate, for example, 
which is to be reported to two decimal places, the 
squared terms should be calculated to at least four deci- 
mal places to achieve the required level of accuracy. 
From the opposite viewpoint, in terms of a root sum 
square estimate, if one or more of the squared terms 
were calculated to only two decimal places, it would be 
incorrect to report the final estimate to any greater accu- 
racy than one decimal place. 

All estimates, except repeatability, should be rounded 
up or down to the smallest unit of measure (rounding 
unit). As a result of its definition (see 13.1.7.1), a 
repeatability estimate should always be rounded up to 
the nearest rounding unit. 


13.1.8.4 EXAMPLE 


Consider the indirect measurement of the volume at 
standard temperature of liquid in a tank. This is to be 
estimated from a set of repeated gage readings, a calibra- 
tion table, a set of repeated temperature measurements, 
and a temperature correction formula. Each set of direct 
measurements (gage readings and temperatures) will be 
considered separately according to 13.1.8.1, and the ap- 
propriate statistics will be derived. These will then be 
combined to give estimates in terms of the liquid volume 
corrected for expansion in the tank resulting from non- 
standard temperature. 

For the purpose of this example, the procedures of 
13.1.8.1 will only be described in detail with respect to 
the set of gage readings. Statistics for the set of tempera- 
ttire data will be given. Note that statistics that are to be 
used at a later stage in the calculations will be stated toa 
greater level of accuracy (one or two more decimal 
places) than that achieved in the corresponding meas- 
urements. This is to ensure that the final estimate of vol- 
ume includes no rounding errors. Note also that the 
figures used in the example were chosen strictly for illus- 
trative purposes, and are not necessarily typical of those 
to be found in practice. 
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13.1.8.4.1. Direct Measurements 


In this section, the procedures of 13.1.8.1 will be ap- 
plied to the single set of tank gage measurements. This 
can be considered as separate steps as follows: 


Step 1—Information available. 


Six gage measurements x,, fori = 1 to 6 (see 13.1.8.1) 
were recorded to the nearest millimeter: 6534, 6544, 
6542. 6540, 6543, and 6544. Unknown systematic errors 
were expected as a result of sludge at the bottom of the 
tank and inaccuracy in the tank gage tape. These errors 
(see 13.1.8.1.3) recorded in millimeters as: 


Source of Maximum Range of Error 
Systematic Error e @, 
Sludge —4 0 
Tape -1 +1 


It is also known from an independent study that the 
repeatability for tank gaging was 7 millimeters. 


Step 2—Outlying results. 


The first gage reading differs from the others by what 
appears to be an appreciable amount. As a quick check 
on its validity, the critical range for the set of measure- 
ments, rounded to the nearest millimeter, is calculated 
from Equation 24 as: 


D(njr 
V2 X (tos, n-1) 


w= 


Where: 


n= 6, 

D(6) 2.543 (see Table 1). 
r= 7. 

2.571 (see Table 2). 


Nl 


ty5.5 


Therefore: 


ioe 2D. 
ne tos. a 

2.534 * 7 

2 X 2.571 

5 millimeters 


The observed range of 10 exceeds this value, so Dixon’s 
outlier test was applied (see Appendix B). The appropri- 
ate Dixon ratio for six measurements and for testing a 
low value is: 


Rw 6540 — 6534 
~ 6544 — 6534 
= 0.6 


which exceeds the critical ratio at the 95 percent 
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probability level. The first measurement was rejected as 
a faulty reading (outlier), and all following calculations 
disregard it. 


Step 3—Correcting for bias. 


According to Equation 4, the average systematic error 
due to tape inaccuracy is zero, but that for sludge is 
given by: 


ol 


of -44+ 0 
2 
—2 millimeters 


The results x; must be adjusted according to Equation 
1 to give the corrected measurements Y,, fori = 1 to 5: 
6546, 6544, 6542, 6545, and 6546. 


Step 4—Estimating true gage reading. 


This will be the average ¥ of the results corrected for 
bias (Equation 5), that is: 


7m 6546 + 6544 + ... + 6546 
. 5 
6544.6 millimeters 


Step 5—Estimating standard deviation. 


The standard deviation of corrected measurements 
can be estimated both from Equation 7 as: 


sy) = VCR + 0.8 + 26 + O04 + 14) 


1.67 


and from Equation 8 as: 


s(y) DH 
= thes 
~~ 2,326 


= 1.72 


Statistics derived from the second and more approxi- 
mate estimate will be given in parentheses for compara- 
tive purposes. 


Step 6—Estimating range of uncertainty. 


By substituting the standard deviation estimates into 
Equation 13, the limit of the range of uncertainty due to 
random errors becomes: 


ay) a= Hawal eu! 
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_ 2,776 X 1.67 
~ os 
2.07 (2.14) millimeters 

Since there are two unknown sources of systematic er- 
ror, the corresponding limits of uncertainty will be esti- 
mated for each according to Equations 15 and 16, 


respectively, as: 
0.95 x (4 =| 


limit due to sludge 5,(y) 


1.9 millimeters 
0.95 x |- 1| 
0.95 millimeter 


limit due to tape b,(y) 


Combining the systematic limits by the root sum 
square method (Equation 18) gives the total limit for 
systematic errors: 


by) = V1.9? + 0.95? 


= 2,12 millimeters 


The limits for systematic and random uncertainties 
should be combined in a similar manner (Equation 19) 
to give: 


cy) 


Vay) + BX) 
V20P = 212 


2.96 (3.01) millimeters 


Il 


Step 7—Estimating repeatability. 


To compare the variability within the set of measure- 
ments to that expected in practice, the repeatability can 
be estimated from Equation 23 as: 


(tos) x 2 x s(y) 
2.716 x /2 x 1.67 


6.6 (6.7) millimeters 


Rounding up to the nearest unit of measure of 1 milli- 
meter (see 13,1.8.3), the repeatability estimates would 
become 7 millimeters. This is identical to the value de- 
rived from the independent study. 


r 


I 


Step 8—Stating the result. 


The estimate of the true gage reading should be stated 
after rounding to the nearest unit of measure (see 
13.1.5.4): 


CY”) = 2.96 ~ 3 
The result statement thus becomes: 


True gage reading = 6545 + 3 millimeters (95 
percent confidence level, 5 
measurements) 


Note: One further result is rejected as a faulty reading (outlier). 
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Table 3—Derived Statistics for Example 


Gage Reading, Thermometer 
Value millimeters Reading, °C 
n 5 9 
y : 6544.6 23.38 
a(y) 2.07 1.362 
Ay) 2.12 0.500 


Table 4—Symbols for Example 


Estimate 
Measurement True Value Corrected for Bias 
Depth xX, My 
Absolute volume X, Ve 
Temperature X; Vs 
Corrected volume X, dy, 


13.1.8.4.2 Measuring Volume 


Next, the statistics derived from the two sets of direct 
measurements are combined according to the proce- 
dures of 13.1.8.2 as follows: 


Step 1—Information available. 


The information corresponding to the direct measure- 
ments can be summarized in the form of derived statis- 
tics as in Table 3. Let us also assume that the symbols 
allocated to each quantity are as listed in Table 4. 

The calibration table, by which a gage reading in mil- 
limeters can be converted to a volume in liters, was ob- 
tained from an unknown function of tank dimensions. 
No random error is created in the use of such a table, 
but an unknown systematic error is expected resulting 
from the approximate nature of the function. This was 
assumed to be level dependent, and the corresponding 
limit of uncertainty is estimated to be: 


A(X.) = 0.2% X, 


Finally, the function (see Equation 25) used to correct 
the volume for temperature and expansion in the tank 
is: 


xX, 


F(X,, X5) 
f(X,) X, [1 + 0.000022 (X, — 15)] 


Table 5—Volume Measurement Statistics for 


Example 
Gage Reading, Volume Measurement, 
Value milliliters liters 
a 5 5 
Ve 6544.6 17016 
aly) 2.07 5 
by) 2.12 6 


Where: 


{(X,) = a factor (read from tables) corresponding 
to a temperature X;. 


Step 2—Estimating absolute volume. 


The calibration table may be regarded as a means of 
converting a liquid depth measurement (gage reading) in 
millimeters to a liquid volume measurement in liters and 
may be represented by the function: 


X, = PX, 


P is nearly constant in this example. The statistics that 
relate to volume results should, therefore, be read di- 
rectly from the table. In this case, they are assumed to be 
those in Table 5. 

The systematic error brought about by inaccuracies in 
the table should be considered at this point. In terms of 
results corrected for bias, the corresponding limit of un- 
certainty will be estimated as: 


b{P.) = 0.2% Py 
0.002 x 17016 
34 liters 


The two limits for systematic errors that affect volume 
results are then combined by the root sum square 
method (Equation 18) to give: 


by.) = VbFO.) + 6? 2) 
VOr+ 3P 


35 liters 


Step 3—Estimating corrected volume. 


According to Equation 26, the estimate of correct vol- 
ume will be obtained by substituting estimates directly 
into the appropriate equation. If we assume that the 
temperature factor f(x,) is read from tables as: 


£05) f(23.38) 
0.98 (given) 


Il 


Then the true corrected volume is estimated to be: 


y, = 0.98 X 17016 [1 + 0.000022(23.38 — 15)] 
= 16678.9 liters 


Step 4—Estimating random uncertainty limit. 


The random errors for volume and temperature mea- 
surements are combined according to Equation 33. In 
our case, the derivatives of the function are: 

aF 


ax, 7 0:98 (1 + 0.000022 (X, ~ 15)] 


and 
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af — 0.98 x xX, x 0.000022 VEE FD IB 


aX, = 34.3 liters 
Substituting the estimated values, 


Note that the systematic error in temperature mea- 


J, = 23.38 (Table 3) surements makes only a small contribution compared 
J» = 17016 (Table 5) with that created by inaccuracies in the calibration table. 
For X; and X,, respectively, gives: Step 6—Stating the result. 
oF _ 0.98018 Combining the random and systematic components of 
aX, uncertainty by quadrature (Equation 19), the total un- 
a = 0.36638 certainty limit becomes: 
aA; 
cee ; a) = VF Ot FO) 
The total limit of random uncertainty will then be given (0) ro pa as y 3 ie 
by: = 34.6 liters 
aly’) = (0.98018 x 5)? + (0.36686 x 1.362)? Rounding to the nearest unit of measurement, which 


4.9 liters from the calibration table was 1 liter, the final statement 


will be: 


True corrected volume = 16,679 =: 35 liters (95 
percent confidence level, 


Step 5—Estimating systematic uncertainty limit. 


Systematic uncertainty limits should be combined in a 


fashion similar to random uncertainty limits according 5 gage measurements, 
to Equation 34. The total limit of systematic uncertainty 9 temperature 
will be: measurements), 


b G. = VO.98018 X 35) + (0.36686 X 0.5)? NOTE: One further gage reading is rejected as a faulty reading. @ 


Copyright por American Petroleum Institute 
Fri Jan 11 21:33:51 2002 


MPMS 13-1-85 


] o7g2e90 


APPENDIX A 
NORMAL (GAUSSIAN) DISTRIBUTION 


Consider a set of » repeated measurements x,, lying in 
the range a to b so that a <= x, < b. If the total range is 
split into p equal subranges of length dx = (b — a)/p,a 
frequency histogram can be drawn. The histogram (see 
Figure A-1) consists of a series of p contiguous rectan- 
gles, with base equal to the subrange dx and height pro- 
portional to the number of measurements falling in that 
range. 

The height of each rectangle could just as easily repre- 
sent the proportion of the total number falling in the 
subrange or the relative frequency. The total area of the 
histogram would then be 1, and the area in each rectan- 
gle would become the probability of a measurement fall- 
ing in the subrange. 

Now consider the number of measurements ” becom- 
ing very large, and the length dx of each subrange be- 
coming very small. A continuous line drawn through the 
midpoint of the tops of each rectangle, which represents 
the relative frequency of measurements, would give a 
bell-shaped curve similar to Figure A-2. 

For the normal distribution, the curve is symmetrical 
about the mean and has the formula: 


1 (xp? 
f{~) = exp ( ) 
oV2r /2a 
wo WwW 
ae 
Oni 
at 
aps 
uu 


Where: 


o = Standard deviation. 


The area under the curve once again represents 
probability. Each of the shaded regions shown has an 
area of: 


poe +00 
P= f f(x) dx = f(x) dx 
— 00 +e 


When ¢ = 1.960, the probability P (one shaded area) 
will be 0.025, or 2% percent of the total area under the 
curve. 

Now if measurements x, follow the normal distribu- 
tion with mean » and standard deviation o, then values 
p, will follow a normal distribution with zero mean and 
unit standard deviation where: 


The value p, is termed the standard normal deviate, and 
has been tabulated for different probabilities P. For a 


A RANGE OF MEASUREMENTS B 


Figure A-1—Frequency Histogram 
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aK RRR 
RNY) QE 


pos ft B+ 
Figure A-2—Bell-Shaped Curve 
probability P = 0.05, however, the standard normal de- includes all values of x which differ from the mean p by 


viate has a value 1.96. This probability is represented by more than 1.96. 
both shaded areas in the distribution (Figure A-2) and 
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APPENDIX B 
DIXON’S TEST FOR OUTLIERS 

The following steps should be followed (see Table B-1—Dixon's Test for Outliers 

Table B-1) to use Dixon’s test for outliers. = aan 
Number of Critical Values Test Criterion 

1. Arrange the set of measurements x, in ascending or- Values,n P=095 P= 0,99 Low Values _ High Values 
der of magnitude x,, x2, ... Xp. 3 0.941 0.988 
2. Choose the appropriate test criterion, depending on ; fee id fee foe ee te 
the value of n and whether the measurement in question 6 0.560 0.698 * XX XH 
is low or high. 7 0.507 0.637 
3. Calculate the Dixon R ratio. If this exceeds the criti- 8 0.554 0.683 _ - 

. ws 9 0.512 0.635 R, = 22% of BOX 
cal ratio at the 5 percent probability level (P = 0.95), 10 0.477 0.597 a aes Xn 
then the measurement in question is highly suspect and an 0576 0.679 
could possibly be rejected. ) 0.546 0.642 Ra = ara or —— 

If the critical ratio at the 1 percent probability level (P 13 0.521 0.615 re va —— 

= 0.99) is exceeded, then the measurement in question : ae on 

should be discarded. 16 0507 0595 

. é . 17 0.490 0.577 

When a measurement is rejected, the outlier test 18 0475 0561 
should be repeated. 19 0.462 0.547 R, = aaa op Xn Xne 
20 0.450 0.535 ee ae ear 

21 0.440 0.524 

22 0.430 0.514 

23 0.421 0.505 

24 0.413 0.497 

NOTE: The two suffixes in the Dixon ratio refer to the differences in as nae os 

the numerator and denominator respectively. SOURCE: Biometrics, Vol. 9, p. 89, 1953. 
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